Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation
نویسندگان
چکیده
Non-negative matrix factorization (NMF) is an appealing technique for many audio applications, such as automatic music transcription, source separation and speech enhancement. Sparsity constraints are commonly used on the NMF model to discover a small number of dominant patterns. Recently, group sparsity has been proposed for NMF based methods, in which basis vectors belonging to a same group are permitted to activate together, while activations across groups are suppressed. However, most group sparsity models penalize all groups using a same parameter without considering the relative importance of different groups for modeling the input data. In this paper, we propose adaptive group sparsity to model the relative importance of different groups with adaptive penalty parameters and investigate its potential benefit to separate speech from other sound sources. Experimental results show that the proposed adaptive group sparsity improves the performance over regular group sparsity in unsupervised settings where neither the speaker identity nor the type of noise is known in advance.
منابع مشابه
Non-Negative Matrix Factorization with Sparsity Learning for Single Channel Audio Source Separation
متن کامل
Iterative Weighted Non-smooth Non-negative Matrix Factorization for Face Recognition
Non-negative Matrix Factorization (NMF) is a part-based image representation method. It comes from the intuitive idea that entire face image can be constructed by combining several parts. In this paper, we propose a framework for face recognition by finding localized, part-based representations, denoted “Iterative weighted non-smooth non-negative matrix factorization” (IWNS-NMF). A new cost fun...
متن کاملSource-filter Based Clustering for Monaural Blind Source Separation
In monaural blind audio source separation scenarios, a signal mixture is usually separated into more signals than active sources. Therefore it is necessary to group the separated signals to the final source estimations. Traditionally grouping methods are supervised and thus need a learning step on appropriate training data. In contrast, we discuss unsupervised clustering of the separated channe...
متن کاملAn Experimental Survey on Non-Negative Matrix Factorization for Single Channel Blind Source Separation
In applications such as speech and audio denoising, music transcription, music and audio based forensics, it is desirable to decompose a single-channel recording into its respective sources, commonly referred to as blind source separation (BSS). One of the techniques used in BSS is non-negative matrix factorization (NMF). In NMF both supervised and unsupervised mode of operations is used. Among...
متن کاملRobust Non-Negative Matrix Factorization for Multispectral Data with Sparse Prior
In this work, we study Non-Negative Matrix Factorization (NMF) and compare standard algorithms with an extension to NMF of a Blind Source Separation algorithm using sparsity, Generalized Morphological Component Analysis (GMCA). We also develop a more robust version of GMCA handling more precisely the priors through sub-iterations, which we call rGMCA. We present preliminary results showing GMCA...
متن کامل